Refer to Online Appendix A for details of the two main data files: pat80_11.dta and cite80_11.dta. While the Online Appendix describes the data files pat63_11.dta, pat80_11.dta (which is simply a truncated version of pat63_11.dta) is enough for replication since we restrict our analysis to such time period. cite80_11.dta is the pairwise citations data corresponding to that period.


The data file pat80_11_16c.dta that appears in the following *.do files are identical with pat80_11.dta except for that 1) the country variable is recoded (and replaced) so that all patents fall in one of the 16 "countries" that appear in the paper and 2) it includes a new variable, p, that recodes the grant year variable to 8 corresponding periods as in our paper. 

The attached do-file, to16c.do, constructs pat80_11_16.dta from pat80_11.dta. 

There are four additional *.do files (run on Stata) and two *.g files (run on GAUSS): 

1) table2_data.do constructs the *.dta file necessary to run the regression of Table 2.

2) table2_reg.do runs the regression of Table 2 using the *.dta file constructed by table2_data.do

3) table3.do calculates the Hirsch index for each country/period cell.

4) table4_data.do constructs the *.raw files necessary to run the regression of Table 4.

5) table4_reg_1.g  runs the regression of Table 4 for period 1 and 2 using the *.raw files constructed by table4_data.do

6) table4_reg_2.g  runs the regression of Table 4 for period 3 - period 8 using the *.raw files constructed by table4_data.do

